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TITLE OF THE INVENTION 
PROTEIN-PROTEIN INTERACTIONS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] The present application is related to U.S. provisional patent application Serial No. 
60/256,986, filed on 21 December 2000, incorporated herein by reference, and claims priority 
thereto under 35 USC § 119(e). 

BACKGROUND OF THE INVENTION 

[0002] The present invention relates to the discovery of novel protein-protein interactions 
that are involved in mammalian physiological pathways, including physiological disorders or 
diseases. Examples of physiological disorders and diseases include non-insulin dependent diabetes 
mellitus (NIDDM), neurodegenerative disorders, such as Alzheimer's Disease (AD), and the like. 
Thus, the present invention is directed to complexes of these proteins and/or their fragments, 
antibodies to the complexes, diagnosis of physiological generative disorders (including diagnosis 
of a predisposition to and diagnosis of the existence of the disorder), drug screening for agents 
which modulate the interaction of proteins described herein, and identification of additional proteins 
in the pathway common to the proteins described herein. 

[0003] The publications and other materials used herein to illuminate the background of the 
invention, and in particular, cases to provide additional details respecting the practice, are 
incorporated herein by reference, and for convenience, are referenced by author and date in the 
following text and respectively grouped in the appended Bibliography. 

[0004] Many processes in biology, including transcription, translation and metabolic or 
signal transduction pathways, are mediated by non-covalently associated protein complexes. The 
formation of protein-protein complexes or protein-DNA complexes produce the most efficient 
chemical machinery. Much of modern biological research is concerned with identifying proteins 
involved in cellular processes, determining their functions, and how, when and where they interact 
with other proteins involved in specific pathways. Further, with rapid advances in genome 
sequencing, there is a need to define protein linkage maps, i.e., detailed inventories of protein 
interactions that make up functional assemblies of proteins or protein complexes or that make up 
physiological pathways. 

[0005] Recent advances in human genomics research has led to rapid progress in the 
identification of novel genes. In applications to biological and pharmaceutical research, there is a 
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need to determine functions of gene products. A first step in defining the function of a novel gene 
is to determine its interactions with other gene products in appropriate context. That is, since 
proteins make specific interactions with other proteins or other biopolymers as part of functional 
assemblies or physiological pathways, an appropriate way to examine function of a gene is to 
5 determine its physical relationship with other genes. Several systems exist for identifying protein 
interactions and hence relationships between genes. 

[0006] There continues to be a need in the art for the discovery of additional protein-protein 
interactions involved in mammalian physiological pathways. There continues to be a need in the 
art also to identify the protein-protein interactions that are involved in mammalian physiological 
10 disorders and diseases, and to thus identify drug targets. 

?■% 

11 SUMMARY OF THE INVENTION 

7* [0007] The present invention relates to the discovery of protein-protein interactions that are 

Vj involved in mammalian physiological pathways, including physiological disorders or diseases, and 
s|J5 to the use of this discovery. The identification of the interacting proteins described herein provide 
1^ new targets for the identification of useful pharmaceuticals, new targets for diagnostic tools in the 
jj; identification of individuals at risk, sequences for production of transformed cell lines, cellular 
M ! models and animal models, and new bases for therapeutic intervention in such physiological 
jl' pathways 

20 [0008] Thus, one aspect of the present invention is protein complexes. The protein 

complexes are a complex of (a) two interacting proteins, (b) a first interacting protein and a fragment 
of a second interacting protein, (c) a fragment of a first interacting protein and a second interacting 
protein, or (d) a fragment of a first interacting protein and a fragment of a second interacting protein. 
The fragments of the interacting proteins include those parts of the proteins, which interact to form 

25 a complex. This aspect of the invention includes the detection of protein interactions and the 
production of proteins by recombinant techniques. The latter embodiment also includes cloned 
sequences, vectors, transfected or transformed host cells and transgenic animals. 

[0009] A second aspect of the present invention is an antibody that is immunoreactive with 
the above complex. The antibody may be a polyclonal antibody or a monoclonal antibody. While 

30 the antibody is immunoreactive with the complex, it is not immunoreactive with the component 
parts of the complex. That is, the antibody is not immunoreactive with a first interactive protein, 
a fragment of a first interacting protein, a second interacting protein or a fragment of a second 
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-interacting protein. Such antibodies can be used to detect the presence or absence of the protein 
complexes. 

[0010] A third aspect of the present invention is a method for diagnosing a predisposition 
for physiological disorders or diseases in a human or other animal. The diagnosis of such disorders 
5 includes a diagnosis of a predisposition to the disorders and a diagnosis for the existence of the 
disorders. In accordance with this method, the ability of a first interacting protein or fragment 
thereof to form a complex with a second interacting protein or a fragment thereof is assayed, or the 
genes encoding interacting proteins are screened for mutations in interacting portions of the protein 
molecules. The inability of a first interacting protein or fragment thereof to form a complex, or the 
1 0 presence of mutations in a gene within the interacting domain, is indicative of a predisposition to, 
M or existence of a disorder. In accordance with one embodiment of the invention, the ability to form 
f i a complex is assayed in a two-hybrid assay. In a first aspect of this embodiment, the ability to form 
a complex is assayed by a yeast two-hybrid assay. In a second aspect, the ability to form a complex 
W"! is assayed by a mammalian two-hybrid assay. In a second embodiment, the ability to form a 
;0 5 complex is assayed by measuring in vitro a complex formed by combining said first protein and said 
L : second protein. In one aspect the proteins are isolated from a human or other animal. In a third 
JH embodiment, the ability to form a complex is assayed by measuring the binding of an antibody, 
M which is specific for the complex. In a fourth embodiment, the ability to form a complex is assayed 
y, by measuring the binding of an antibody that is specific for the complex with a tissue extract from 
20 a human or other animal. In a fifth embodiment, coding sequences of the interacting proteins 
described herein are screened for mutations. 

[001 1] A fourth aspect of the present invention is a method for screening for drug candidates 
which are capable of modulating the interaction of a first interacting protein and a second interacting 
protein. In this method, the amount of the complex formed in the presence of a drug is compared 
25 with the amount of the complex formed in the absence of the drug. If the amount of complex 
formed in the presence of the drug is greater than or less than the amount of complex formed in the 
absence of the drug, the drug is a candidate for modulating the interaction of the first and second 
interacting proteins. The drug promotes the interaction if the complex formed in the presence of the 
drug is greater and inhibits (or disrupts) the interaction if the complex formed in the presence of the 
30 drug is less. The drug may affect the interaction directly, i.e., by modulating the binding of the two 
proteins, or indirectly, e.g., by modulating the expression of one or both of the proteins. 
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[0012] A fifth aspect of the present invention is a model for such physiological pathways, 
disorders or diseases. The model may be a cellular model or an animal model, as further described 
herein. In accordance with one embodiment of the invention, an animal model is prepared by 
creating transgenic or "knock-out" animals. The knock-out may be a total knock-out, i.e., the 
5 desired gene is deleted, or a conditional knock-out, i.e., the gene is active until it is knocked out at 
a determined time. In a second embodiment, a cell line is derived from such animals for use as a 
model. In a third embodiment, an animal model is prepared in which the biological activity of a 
protein complex of the present invention has been altered. In one aspect, the biological activity is 
altered by disrupting the formation of the protein complex, such as by the binding of an antibody 
10 or small molecule to one of the proteins which prevents the formation of the protein complex. In 
ys. a second aspect, the biological activity of a protein complex is altered by disrupting the action of 
I the complex, such as by the binding of an antibody or small molecule to the protein complex which 
: '; interferes with the action of the protein complex as described herein. In a fourth embodiment, a cell 

yi model is prepared by altering the genome of the cells in a cell line. In one aspect^the genome of 

0 

% {|5 the cells is modified to produce at least one protein complex described herein. In a second aspect, 
f the genome of the cells is modified to eliminate at least one protein of the protein complexes 
t - described herein. 

%s> [0013] A sixth aspect of the present invention are nucleic acids coding for novel proteins 

f s discovered in accordance with the present invention and the corresponding proteins and antibodies. 
20 [0014] A seventh aspect of the present invention is a method of screening for drug 

candidates useful for treating a physiological disorder. In this embodiment, drugs are screened on 
the basis of the association of a protein with a particular physiological disorder. This association 
is established in accordance with the present invention by identifying a relationship of the protein 
with a particular physiological disorder. The drugs are screened by comparing the activity of the 
25 protein in the presence and absence of the drug. If a difference in activity is found, then the drug 
is a drug candidate for the physiological disorder. The activity of the protein can be assayed in vitro 
or in vivo using conventional techniques, including transgenic animals and cell lines of the present 
invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

[0015] The present invention is the discovery of novel interactions between proteins 
described herein. The genes coding for some of these proteins may have been cloned previously, 
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but their potential interaction in a physiological pathway or with a particular protein was unknown. 
Alternatively, the genes coding for some of these proteins have not been cloned previously and 
represent novel genes. These proteins are identified using the yeast two-hybrid method and 
searching a human total brain library, as more fully described below. 
5 [0016] According to the present invention, new protein-protein interactions have been 

discovered. The discovery of these interactions has identified several protein complexes for each 
protein-protein interaction. The protein complexes for these interactions are set forth below in 
Tables 1-2, which also identifies the new protein-protein interactions of the present invention. 



10 TABLE 1 

jds Protein Complexes ER-alpha/PN12364 Interaction 

~p- Estrogen receptor l(ER-alpha) and PN12364 

W A fragment of ER-alpha and PN12364 

|j1 ER-alpha and a fragment of PN12364 

A 5 A fragment of ER-alpha and a fragment of PN1 23 64 

? J TABLE 2 

P Protein Complexes ER-beta/PN 12365 Interaction 

l: Estrogen receptor 2(ER-beta) and PN12365 

20 A fragment of ER-alpha and PN12365 

ER-alpha and a fragment of PN12365 
A fragment of ER-alpha and a fragment of PN12365 



[0017] The involvement of above interactions in particular pathways is as follows. 

25 [001 8] Many cellular proteins exert their function by interacting with other proteins in the 

cell. Examples of this are found in the formation of multiprotein complexes and the association of 
enzymes with their substrates. It is widely believed that a great deal of information can be gained 
by understanding individual protein-protein interactions, and that this is useful in identifying 
complex networks of interacting proteins that participate in the workings of normal cellular 

30 functions. Ultimately, the knowledge gained by characterizing these networks can lead to valuable 
insight into the causes of human diseases and can eventually lead to the development of therapeutic 
strategies. The yeast two-hybrid assay is a powerful tool for determining protein-protein 
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interactions and it has been successfully used for studying human disease pathways. In one 
variation of this technique, a protein of interest (or a portion of that protein) is expressed in a 
population of yeast cells that collectively contain all protein sequences. Yeast cells that possess 
protein sequences that interact with the protein of interest are then genetically selected, and the 
5 identity of those interacting proteins are determined by DNA sequencing. Thus, proteins that can 
be demonstrated to interact with a protein known to be involved in a human disease are therefore 
also implicated in that disease. Proteins identified in the first round of two-hybrid screening can be 
subsequently used in a second round of two-hybrid screening, allowing the identification of multiple 
proteins in the complex network of interactions in a disease pathway. 
10 [001 9] Nuclear hormone receptors play important roles in development, reproduction, and 

§.-& physiology by altering gene transcription in response to hormonal signals (Whitfield et al., 1999; 
5;; Klein-Hitpass et al, 1998). Misregulation of hormone receptor signaling pathways is responsible 
W for a variety of diseases. For example, aldosterone and its receptor (the mineralocorticoid receptor, 
|j1 MCR) are involved in hypertension and congestive heart failure (Duprez et al., 2000), and it has 
a|5 recently been shown that a missense mutation in MCR that alters its ligand specificity is responsible 
f for pregnancy-exacerbated hypertension (Geller et al., 2000). Likewise, glucocorticoids and the 

f- glucocorticoid receptor (GR) have been implicated in chronic inflammation and arthritis (Banres, 

flJ 

I,* 1998), and the oxysterol liver receptor (LXR), farnesoid X receptor (FXR), and other nuclear 

'H receptors are involved in cholesterol homeostasis and atherogenesis (Schroepfer, 2000; Haynes et 

20 al., 2000; Brown and Jessup, 1 999) 

[0020] Collectively, the nuclear receptor superfamily is responsive to a wide variety of 
ligands. Nuclear hormone receptors share several important structural features, including a variable 
N-terminal region, a conserved central DNA-binding domain, a variable hinge region, and a 
conserved C-terminal ligand-binding domain (Moras and Gronemeyer, 1998; Mangelsdorf et al., 

25 1995). Despite this conserved structural organization, interactions between ligands and receptors 
are remarkably specific. Hormone binding results in conformational changes in the receptor, 
allowing binding to specific DNA sequences (hormone response elements, HREs) in target gene 
promoters resulting in changes in target gene transcription. Interaction of nuclear hormone receptors 
with accessory proteins determines whether the receptor activates or represses transcription. 

30 Receptors can recruit coactivators that remodel chromatin and stabilize the RNA polymerase 
machinery, or alternatively can interact with factors that condense chromatin structure and inactivate 
gene expression (Wolffe et al, 1997). Furthermore, binding of a nuclear hormone receptor to other 



cellular proteins can alter the subcellular localization of the receptor and control its ability to bind 
hormone and HREs (DeFranco et al., 1998). Clearly, identification of factors with which nuclear 
hormone receptors interact is vital to understanding the process by which hormonal signals are 
transduced into transcriptional responses. In addition, identification of receptor-interacting proteins 
will increase the repertoire of potential targets for therapeutic intervention in the treatment of 
diseases due to defects involving nuclear hormone signaling. 

[0021] Several nuclear hormone receptors were used as bait in yeast two-hybrid searches, 
and as a result novel interactions between these receptors and a number of extracellular proteins 
were identified. The number of interactions between different nuclear hormone receptors and a 
variety of extracellular proteins suggest these interactions may play a role in regulating the 
transactivation activity of nuclear receptors in response to hormonal signals. 

[0022] We have identified an interaction two novel proteins (PN12364 and PN12365) with 
homology to an extracellular protein were found to interact with ER-alpha and ER-beta, 
respectively. PN12364 and PN12365 are 97% and 95% identical (respectively) at the amino acid 
level to human Notch2 (GenBank AF3 15356). 

[0023] The alpha and beta isoforms of human estrogen receptor (ERa and ERb) are nuclear 
hormone receptors that display sequence similarity to the glucocorticoid receptor (GR) and function 
as homodimers to regulate transcription in response to 17-beta-estradiol. Mutations in ERa have 
been implicated in the development and progression of breast cancer (Clark and McGuire, 1988; 
McGuire et al., 1991) and ERa and ERb are implicated in pituitary adenomas (Shupnik et al., 1998). 
ER activity appears to be modulated by phosphorylation at specific residues by the cyclin A-CDK2 
complex (Rogatsky et al., 1999) and by interaction with other cellular proteins such as rho GTPases 
and (Su et al, 2000; Knoblauch and Garabedian, 1999). 

[0024] The interaction of nuclear hormone receptors with putative extracellular proteins 
does not necessarily imply that the nuclear receptor is localized extracellularly. It is clear that some 
extracellular proteins exist at low levels within the cytoplasm, and even those destined for transport 
outside the cell exist transiently within the cytoplasm. Thus, it is possible that the interaction 
between the nuclear hormone receptors and these proteins results in sequestration of the receptor 
in a non-nuclear compartment, which would in turn affect the ability of the receptor to regulation 
transcription; such a role has been postulated for the interaction of nuclear hormone receptors and 
various heat shock proteins (DeFranco et al., 1998). 
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[0025] The proteins disclosed in the present invention were found to interact with their 
corresponding proteins in the yeast two-hybrid system. Because of the involvement of the corresponding 
proteins in the physiological pathways disclosed herein, the proteins disclosed herein also participate in 
the same physiological pathways. Therefore, the present invention provides a list of uses of these proteins 
5 and DNA encoding these proteins for the development of diagnostic and therapeutic tools useful in the 
physiological pathways. This list includes, but is not limited to, the following examples. 

Two-Hybrid System 

[0026] The principles and methods of the yeast two-hybrid system have been described in 
1 0 detail elsewhere (e.g., Bartel and Fields, 1 997; Bartel et al., 1 993; Fields and Song, 1 989; Chevray 
§,s and Nathans, 1 992). The following is a description of the use of this system to identify proteins that 

|5 interact with a protein of interest. 

[0027] The target protein is expressed in yeast as a fusion to the DNA-binding domain of 
If! the yeast Gal4p. DNA encoding the target protein or a fragment of this protein is amplified from 

.15 cDNA by PCR or prepared from an available clone. The resulting DNA fragment is cloned by 
* . ligation or recombination into a DNA-binding domain vector (e.g., pGBT9, pGBT.C, pAS2-l) such 

fy that an in-frame fusion between the Gal4p and target protein sequences is created. 

HJ 

%a [0028] The target gene construct is introduced, by transformation, into a haploid yeast strain. 

I": 

Tz A library of activation domain fusions (i.e., adult brain cDNA cloned into an activation domain 

20 vector) is introduced by transformation into a haploid yeast strain of the opposite mating type. The 
yeast strain that carries the activation domain constructs contains one or more Gal4p-responsive 
reporter gene(s), whose expression can be monitored. Examples of some yeast reporter strains 
include Y190, PJ69, and CBY14a. An aliquot of yeast carrying the target gene construct is 
combined with an aliquot of yeast carrying the activation domain library. The two yeast strains mate 
25 to form diploid yeast and are plated on media that selects for expression of one or more Gal4p- 
responsive reporter genes. Colonies that arise after incubation are selected for further 
characterization. 

[0029] The activation domain plasmid is isolated from each colony obtained in the two- 
hybrid search. The sequence of the insert in this construct is obtained by the dideoxy nucleotide 
30 chain termination method. Sequence information is used to identify the gene/protein encoded by the 
activation domain insert via analysis of the public nucleotide and protein databases. Interaction of 
the activation domain fusion with the target protein is confirmed by testing for the specificity of the 



9 

interaction. The activation domain construct is co-transformed into a yeast reporter strain with either 
the original target protein construct or a variety of other DNA-binding domain constructs. 
Expression of the reporter genes in the presence of the target protein but not with other test proteins 
indicates that the interaction is genuine. 
5 [0030] hi addition to the yeast two-hybrid system, other genetic methodologies are available 

for the discovery or detection of protein-protein interactions. For example, a mammalian two-hybrid 
system is available commercially (Clontech, Inc.) that operates on the same principle as the yeast 
two-hybrid system. Instead of transforming a yeast reporter strain, plasmids encoding DNA-binding 
and activation domain fusions are transfected along with an appropriate reporter gene (e.g., lacZ) 
1 0 into a mammalian tissue culture cell line. Because transcription factors such as the Saccharomyces 
cerevisiae Gal4p are functional in a variety of different eukaryotic cell types, it would be expected 
f\ that a two-hybrid assay could be performed in virtually any cell line of eukaryotic origin (e.g., insect 
% cells (SF9), fungal cells, worm cells, etc.). Other genetic systems for the detection of protein-protein 
lil interactions include the so-called SOS recruitment system (Aronheim et al., 1997). 

C 15 

* Protein-protein interactions 

If [003 1] Protein interactions are detected in various systems including the yeast two-hybrid 

§?* system, affinity chromatography, co-immunoprecipitation, subcellular fractionation and isolation 
Z! °f large molecular complexes. Each of these methods is well characterized and can be readily 

20 performed by one skilled in the art. See, e.g., U.S. Patents No. 5,622,852 and 5,773,21 8, and PCT 

published applications No. WO 97/27296 and WO 99/65939, each of which are incorporated herein 

by reference. 

[0032] The protein of interest can be produced in eukaryotic or prokaryotic systems. A 
cDNA encoding the desired protein is introduced in an appropriate expression vector and transfected 

25 in a host cell (which could be bacteria, yeast cells, insect cells, or mammalian cells). Purification 
of the expressed protein is achieved by conventional biochemical and immunochemical methods 
well known to those skilled in the art. The purified protein is then used for affinity chromatography 
studies: it is immobilized on a matrix and loaded on a column. Extracts from cultured cells or 
homogenized tissue samples are then loaded on the column in appropriate buffer, and non-binding 

30 proteins are eluted. After extensive washing, binding proteins or protein complexes are eluted using 
various methods such as a gradient of pH or a gradient of salt concentration. Eluted proteins can 
then be separated by two-dimensional gel electrophoresis, eluted from the gel, and identified by 
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micro-sequencing. The purified proteins can also be used for affinity chromatography to purify 
interacting proteins disclosed herein. All of these methods are well known to those skilled in the 
art. 

[0033] Similarly, both proteins of the complex of interest (or interacting domains thereof) 
5 can be produced in eukaryotic or prokaryotic systems. The proteins (or interacting domains) can 
be under control of separate promoters or can be produced as a fusion protein. The fusion protein 
may include a peptide linker between the proteins (or interacting domains) which, in one 
embodiment, serves to promote the interaction of the proteins (or interacting domains). All of these 
methods are also well known to those skilled in the art. 
1 0 [0034] Purified proteins of interest, individually or a complex, can also be used to generate 

antibodies in rabbit, mouse, rat, chicken, goat, sheep, pig, guinea pig, bovine, and horse. The 
methods used for antibody generation and characterization are well known to those skilled in the 
art. Monoclonal antibodies are also generated by conventional techniques. Single chain antibodies 
are further produced by conventional techniques. 
15 [0035] DNA molecules encoding proteins of interest can be inserted in the appropriate 

expression vector and used for transfection of eukaryotic cells such as bacteria, yeast, insect cells, 
or mammalian cells, following methods well known to those skilled in the art. Transfected cells 
expressing both proteins of interest are then lysed in appropriate conditions, one of the two proteins 
is immunoprecipitated using a specific antibody, and analyzed by polyacrylamide gel 
20 electrophoresis. The presence of the binding protein (co-immunoprecipitated) is detected by 
immunoblotting using an antibody directed against the other protein. Co-immunoprecipitation is 
a method well known to those skilled in the art. 

[0036] Transfected eukaryotic cells or biological tissue samples can be homogenized and 
fractionated in appropriate conditions that will separate the different cellular components. Typically, 
25 cell lysates are run on sucrose gradients, or other materials that will separate cellular components 
based on size and density. Subcellular fractions are analyzed for the presence of proteins of interest 
with appropriate antibodies, using immunoblotting or immunoprecipitation methods. These methods 
are all well known to those skilled in the art. 

3 0 Disruption of protein-protein interactions 

[0037] It is conceivable that agents that disrupt protein-protein interactions can be beneficial 
in many physiological disorders, including, but not-limited to NTDDM, AD and others disclosed 



11 

herein. Each of the methods described above for the detection of a positive protein-protein 
interaction can also be used to identify drugs that will disrupt said interaction. As an example, cells 
transfected with DNAs coding for proteins of interest can be treated with various drugs, and co- 
immunoprecipitations can be performed. Alternatively, a derivative of the yeast two-hybrid system, 
called the reverse yeast two-hybrid system (Leanna and Hannink, 1996), can be used, provided that 
the two proteins interact in the straight yeast two-hybrid system. 

Modulation of protein-protein interactions 

[0038] Since the interactions described herein are involved in a physiological pathway, the 
identification of agents which are capable of modulating the interactions will provide agents which 
can be used to track physiological disorder or to use lead compounds for development of therapeutic 
agents. An agent may modulate expression of the genes of interacting proteins, thus affecting 
interaction of the proteins. Alternatively, the agent may modulate the interaction of the proteins. 

The agent may modulate the interaction of wild-type with wild-type proteins, wild-type with mutant 
proteins, or mutant with mutant proteins. Agents which may be used to modulate the protein 
interaction inlcude a peptide, an antibody, a nucleic acid, an antisense compound or a ribozyme. 

The nucleic acid may encode the antibody or the antisense compound. The peptide may be at least 
4 amino acids of the sequence of either of the interacting proteins. Alternatively, the peptide may 
be from 4 to 30 amino acids (or from 8 to 20 amino acids) that is at least 75% identical to a 
contiguous span of amino acids of either of the interacting proteins. The peptide may be covalently 
linked to a transporter capable of increasing cellular uptake of the peptide. Examples of a suitable 
transporter include penetratins, /-Tat 49 . 57 , <2-Tat 49 _ 57 , retro-inverso isomers of /- or d-Tat 49 _ S7 , L- 
arginine oligomers, D- arginine oligomers, L-lysine oligomers, D-lysine oligomers, L-histine 
oligomers, D-histine oligomers, L-ornithine oligomers, D-ornithine oligomers, short peptide 
sequences derived from fibroblast growth factor, Galparan, and HSV-1 structural protein VP22, and 
peptoid analogs thereof. Agents can be tested using transfected host cells, cell lines, cell models or 
animals, such as described herein, by techniques well known to those of ordinary skill in the art, such 
as disclosed in U.S. Patents Nos. 5,622,852 and 5,773,218, and PCT published application Nos. WO 
97/27296 and WO 99/65939, each of which are incorporated herein by reference. The modulating 
effect of the agent can be tested in vivo or in vitro. Agents can be provided for testing in a phage 
display library or a combinatorial library. Exemplary of a method to screen agents is to measure 
the effect that the agent has on the formation of the protein complex. 
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Mutation screening 

[0039] The proteins disclosed in the present invention interact with one or more proteins 
known to be involved in a physiological pathway, such as in NIDDM, AD or pathways described 
herein. Mutations in interacting proteins could also be involved in the development of the 
physiological disorder, such as NIDDM, AD or disorders described herein, for example, through 
a modification of protein-protein interaction, or a modification of enzymatic activity, modification 
of receptor activity, or through an unknown mechanism. Therefore, mutations can be found by 
sequencing the genes for the proteins of interest in patients having the physiological disorder, such 
as insulin, and non-affected controls. A mutation in these genes, especially in that portion of the 
gene involved in protein interactions in the physiological pathway, can be used as a diagnostic tool 
and the mechanistic understanding the mutation provides can help develop a therapeutic tool. 

Screening for at-risk individuals 

[0040] Individuals can be screened to identify those at risk by screening for mutations in the 
protein disclosed herein and identified as described above. Alternatively, individuals can be 
screened by analyzing the ability of the proteins of said individual disclosed herein to form natural 
complexes. Further, individuals can be screened by analyzing the levels of the complexes or 
individual proteins of the complexes or the mRNA encoding the protein members of the complexes. 
Techniques to detect the formation of complexes, including those described above, are known to 
those skilled in the art. Techniques and methods to detect mutations are well known to those skilled 
in the art. Techniques to detect the level of the complexes, proteins or mRNA are well known to 
those skilled in the art. 

Cellular models of Physiological Disorders 

[0041] A number of cellular models of many physiological disorders or diseases have been 
generated. The presence and the use of these models are familiar to those skilled in the art. As an 
example, primary cell cultures or established cell lines can be transfected with expression vectors 
encoding the proteins of interest, either wild-type proteins or mutant proteins. The effect of the 
proteins disclosed herein on parameters relevant to their particular physiological disorder or disease 
can be readily measured. Furthermore, these cellular systems can be used to screen drugs that will 
influence those parameters, and thus be potential therapeutic tools for the particular physiological 
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disorder or disease. Alternatively, instead of transfecting the DNA encoding the protein of interest, 
the purified protein of interest can be added to the culture medium of the cells under examination, 
and the relevant parameters measured. 

5 Animal models 

[0042] The DNA encoding the protein of interest can be used to create animals that 
overexpress said protein, with wild-type or mutant sequences (such animals are referred to as 
"transgenic"), or animals which do not express the native gene but express the gene of a second 
animal (referred to as "transplacement"), or animals that do not express said protein (referred to as 
10 "knock-out"). The knock-out animal may be an animal in which the gene is knocked out at a 

I s * determined time. The generation of transgenic, transplacement and knock-out animals (normal and 

£3 

r 3 conditioned) uses methods well known to those skilled in the art. 

1 '! [0043] In these animals, parameters relevant to the particular physiological disorder can be 

4* 

W measured. These parametes may include receptor function, protein secretion in vivo or in vitro, 
=j~T 5 survival rate of cultured cells, concentration of particular protein in tissue homogenates, signal 
J s transduction, behavioral analysis, protein synthesis, cell cycle regulation, transport of compounds 
[H across cell or nuclear membranes, enzyme activity, oxidative stress, production of pathological 
§=* products, and the like. The measurements of biochemical and pathological parameters, and of 
ju behavioral parameters, where appropriate, are performed using methods well known to those skilled 
20 in the art. These transgenic, transplacement and knock-out animals can also be used to screen drugs 
that may influence the biochemical, pathological, and behavioral parameters relevant to the 
particular physiological disorder being studied. Cell lines can also be derived from these animals 
for use as cellular models of the physiological disorder, or in drug screening. 

25 Rational drug design 

[0044] The goal of rational drug design is to produce structural analogs of biologically 
active polypeptides of interest or of small molecules with which they interact (e.g., agonists, 
antagonists, inhibitors) in order to fashion drugs which are, for example, more active or stable forms 
of the polypeptide, or which, e.g., enhance or interfere with the function of a polypeptide in vivo. 
30 Several approaches for use in rational drug design include analysis of three-dimensional structure, 
alanine scans, molecular modeling and use of anti-id antibodies. These techniques are well known 
to those skilled in the art. Such techniques may include providing atomic coordinates defining a 
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three-dimensional structure of a protein complex formed by said first polypeptide and said second 
polypeptide, and designing or selecting compounds capable of interfering with the interaction 
between a first polypeptide and a second polypeptide based on said atomic coordinates. 

[0045] Following identification of a substance which modulates or affects polypeptide 
activity, the substance may be further investigated. Furthermore, it may be manufactured and/or used 
in preparation, i.e., manufacture or formulation, or a composition such as a medicament, 
pharmaceutical composition or drug. These may be administered to individuals. 

[0046] A substance identified as a modulator of polypeptide function may be peptide or non- 
peptide in nature. Non-peptide "small molecules" are often preferred for many in vivo 
pharmaceutical uses. Accordingly, a mimetic or mimic of the substance (particularly if a peptide) 
may be designed for pharmaceutical use. 

[0047] The designing of mimetics to a known pharmaceutically active compound is a known 
approach to the development of pharmaceuticals based on a "lead" compound. This approach might 
be desirable where the active compound is difficult or expensive to synthesize or where it is 
unsuitable for a particular method of administration, e.g., pure peptides are unsuitable active agents 
for oral compositions as they tend to be quickly degraded by proteases in the alimentary canal. 
Mimetic design, synthesis and testing is generally used to avoid randomly screening large numbers 
of molecules for a target property. 

[0048] Once the pharmacophore has been found, its structure is modeled according to its 
physical properties, e.g., stereochemistry, bonding, size and/or charge, using data from a range of 
sources, e.g., spectroscopic techniques, x-ray diffraction data and NMR. Computational analysis, 
similarity mapping (which models the charge and/or volume of a pharmacophore, rather than the 
bonding between atoms) and other techniques can be used in this modeling process. 

[0049] A template molecule is then selected, onto which chemical groups that mimic the 
pharmacophore can be grafted. The template molecule and the chemical groups grafted thereon can 
be conveniently selected so that the mimetic is easy to synthesize, is likely to be pharmacologically 
acceptable, and does not degrade in vivo, while retaining the biological activity of the lead 
compound. Alternatively, where the mimetic is peptide-based, further stability can be achieved by 
cyclizing the peptide, increasing its rigidity. The mimetic or mimetics found by this approach can 
then be screened to see whether they have the target property, or to what extent it is exhibited. 
Further optimization or modification can then be carried out to arrive at one or more final mimetics 
for in vivo or clinical testing. 
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Diagnostic Assays 

[0050] The identification of the interactions disclosed herein enables the development of 
diagnostic assays and kits, which can be used to determine a predisposition to or the existence of 
a physiological disorder. In one aspect, one of the proteins of the interaction is used to detect the 
presence of a "normal" second protein (i.e., normal with respect to its ability to interact with the first 
protein) in a cell extract or a biological fluid, and further, if desired, to detect the quantitative level 
of the second protein in the extract or biological fluid. The absence of the "normal" second protein 
would be indicative of a predisposition or existence of the physiological disorder. In a second 
aspect, an antibody against the protein complex is used to detect the presence and/or quantitative 
level of the protein complex. The absence of the protein complex would be indicative of a 
predisposition or existence of the physiological disorder. 

Nucleic Acids and Proteins 

[0051] A nucleic acid or fragment thereof has substantial identity with another if, when 
optimally aligned (with appropriate nucleotide insertions or deletions) with the other nucleic acid 
(or its complementary strand), there is nucleotide sequence identity in at least about 60% of the 
nucleotide bases, usually at least about 70%, more usually at least about 80%, preferably at least 
about 90%, more preferably at least about 95% of the nucleotide bases, and more preferably at least 
about 98% of the nucleotide bases. A protein or fragment thereof has substantial identity with 
another if, optimally aligned, there is an amino acid sequence identity of at least about 30% identity 
with an entire naturally-occurring protein or a portion thereof, usually at least about 70% identity, 
more ususally at least about 80% identity, preferably at least about 90% identity, more preferably 
at least about 95% identity, and most preferably at least about 98% identity. 

[0052] Identity means the degree of sequence relatedness between two polypeptide or two 
polynucleotides sequences as determined by the identity of the match between two strings of such 
sequences. Identity can be readily calculated. While there exist a number of methods to measure 
identity between two polynucleotide or polypeptide sequences, the term "identity" is well known 
to skilled artisans {Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, 
New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic 
Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, 
H. G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, 



16 

G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M 
Stockton Press, New York, 1991). Methods commonly employed to determine identity between two 
sequences include, but are not limited to those disclosed in Guide to Huge Computers, Martin J. 
Bishop, ed., Academic Press, San Diego, 1994, and Carillo, H., and Lipman, D., SIAM J Applied 
5 Math. 48:1073 (1988). Preferred methods to determine identity are designed to give the largest 
match between the two sequences tested. Such methods are codified in computer programs. 
Preferred computer program methods to determine identity between two sequences include, but are 
not limited to, GCG (Genetics Computer Group, Madison Wis.) program package (Devereux, J., et 
al, Nucleic Acids Research 12(1)387 (1984)), BLASTP, BLASTN, FASTA (Altschul et al. (1990); 
1 0 Altschul et al. (1 997)). The well-known Smith Waterman algorithm may also be used to determine 
IM identity. 

[0053] Alternatively, substantial homology or similarity exists when a nucleic acid or 
h .i fragment thereof will hybridize to another nucleic acid (or a complementary strand thereof) under 
y I selective hybridization conditions, to a strand, or to its complement. Selectivity of hybridization 
y} 5 exists when hybridization which is substantially more selective than total lack of specificity occurs. 
J\ 5 Nucleic acid hybridization will be affected by such conditions as salt concentration, temperature, 
■ or organic solvents, in addition to the base composition, length of the complementary strands, and 
f»* the number of nucleotide base mismatches between the hybridizing nucleic acids, as will be readily 
appreciated by those skilled in the art. Stringent temperature conditions will generally include 

20 temperatures in excess of 30°C, typically in excess of37°C, and preferably in excess of 45 C. 
Stringent salt conditions will ordinarily be less than 1000 mM, typically less than 500 mM, and 
preferably less than 200 mM. However, the combination of parameters is much more important than 
the measure of any single parameter. See, e.g., Asubel, 1992; Wetmur and Davidson, 1968. 

[0054] The terms "isolated", "substantially pure", and "substantially homogeneous" are used 

25 interchangeably to describe a protein or polypeptide which has been separated from components 
which accompany it in its natural state. A monomeric protein is substantially pure when at least 
about 60 to 75% of a sample exhibits a single polypeptide sequence. A substantially pure protein 
will typically comprise about 60 to 90% WAV of a protein sample, more usually about 95%, and 
preferably will be over about 99% pure. Protein purity or homogeneity may be indicated by a 

30 number of means well known in the art, such as polyacrylamide gel electrophoresis of a protein 
sample, followed by visualizing a single polypeptide band upon staining the gel. For certain 
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purposes, higher resolution may be provided by using HPLC or other means well known in the art 

which are utilized for purification. 

[0055] Large amounts of the nucleic acids of the present invention may be produced by (a) 

replication in a suitable host or transgenic animals or (b) chemical synthesis using techniques well 
5 known in the art. Constructs prepared for introduction into a prokaryotic or eukaryotic host may 

comprise a replication system recognized by the host, including the intended polynucleotide 

fragment encoding the desired polypeptide, and will preferably also include transcription and 

translational initiation regulatory sequences operably linked to the polypeptide encoding segment. 

Expression vectors may include, for example, an origin of replication or autonomously replicating 
10 sequence (ARS) and expression control sequences, a promoter, an enhancer and necessary 
|>* processing information sites, such as ribo some-binding sites, RNA splice sites, polyadenylation 
P \ sites, transcriptional terminator sequences, and mRNA stabilizing sequences. Secretion signals may 

also be included where appropriate which allow the protein to cross and/or lodge in cell membranes, 
U1 and thus attain its functional topology, or be secreted from the cell. Such vectors may be prepared 
4 % 5 by means of standard recombinant techniques well known in the art. 

\ ^ [0056] The nucleic acid or protein may also be incorporated on a microarray. The 

- U preparation and use of microarrays are well known in the art. Generally, the microarray may contain 

the entire nucleic acid or protein, or it may contain one or more fragments of the nucleic acid or 
ft protein. Suitable nucleic acid fragments may include at least 17 nucleotides, at least 21 nucleotides, 
20 at least 30 nucleotides or at least 50 nucleotides of the nucleic acid sequence, particularly the coding 

sequence. Suitable protein fragments may include at least 4 amino acids, at least 8 amino acids, at 

least 12 amino acids, at least 15 amino acids, at least 17 amino acids or at least 20 amino acids. 

Thus, the present invention is also directed to such nucleic acid and protein fragments. 

25 EXAMPLES 

[0057] The present invention is further detailed in the following Examples, which are 
offered by way of illustration and are not intended to limit the invention in any manner. Standard 
techniques well known in the art or the techniques specifically described below are utilized. 
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EXAMPLE 1 
Yeast Two-Hvbrid System 
[0058] The principles and methods of the yeast two-hybrid systems have been described in 
detail (Bartel and Fields, 1997). The following is thus a description of the particular procedure that 
5 we used, which was applied to all proteins. 

[0059] The cDNA encoding the bait protein was generated by PCR from brain cDNA. Gene- 
specific primers were synthesized with appropriate tails added at their 5' ends to allow 
recombination into the vector pGBTQ. The tail for the forward primer was 5*- 
GCAGGAAACAGCTATGACCATACAGTCAGCGGCCGCCACC-3' (SEQ ID NO:l) and the tail for the reverse 
1 0 primer was S'-acggccagtcgcgtggagtgttatgtcatgcggccgcta-S' (SEQ ID NO:2). The tailed 

U PCR product was then introduced by recombination into the yeast expression vector pGBTQ, which 

fl 

| is a close derivative of pGBTC (Bartel et al., 1996) in which the polylinker site has been modified 
^ to include Ml 3 sequencing sites. The new construct was selected directly in the yeast J693 for its 
yl ability to drive tryptophane synthesis (genotype of this strain: Mat a, ade2, his3, leu2, trpl, 
J? 5 URA3::GALl-lacZ LYS2::GAL1-HIS3 gal4del gal80del cyhR2). In these yeast cells, the bait is 
L produced as a C-terminal fusion protein with the DNA binding domain of the transcription factor 
JU Gal4 (amino acids 1 to 147). A total human brain (37 year-old male Caucasian) cDNA library 
jj[ cloned into the yeast expression vector pACT2 was purchased from Clontech (human brain 
Z t MATCHMAKER cDNA, cat. # HL4004AH), transformed into the yeast strain J692 (genotype of 
20 this strain: Mat a, ade2, his3, leu2, trpl , URA3 : : GAL 1 -lacZ LYS2::GAL1-HIS3 gal4del gal80del 
cyhR2), and selected for the ability to drive leucine synthesis. In these yeast cells, each cDNA is 
expressed as a fusion protein with the transcription activation domain of the transcription factor 
Gal4 (amino acids 768 to 881) and a 9 amino acid hemagglutinin epitope tag. J693 cells (Mat a 
type) expressing the bait were then mated with J692 cells (Mat a type) expressing proteins from the 
25 brain library. The resulting diploid yeast cells expressing proteins interacting with the bait protein 
were selected for the ability to synthesize tryptophan, leucine, histidine, and |3-galactosidase. DNA 
was prepared from each clone, transformed by electroporation into E. coli strain KC8 (Clontech 
KC8 electrocompetent cells, cat. # C2023-1), and the cells were selected on ampicillin-containing 
plates in the absence of either tryptophane (selection for the bait plasmid) or leucine (selection for 
30 the brain library plasmid). DNA for both plasmids was prepared and sequenced by di- 
deoxynucleotide chain termination method. The identity of the bait cDNA insert was confirmed and 
the cDNA insert from the brain library plasmid was identified using BLAST program against public 
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nucleotides and protein databases. Plasmids from the brain library (preys) were then individually 
transformed into yeast cells together with a plasmid driving the synthesis of lamin fused to the Gal4 
DNA binding domain. Clones that gave a positive signal after (3-galactosidase assay were considered 
false-positives and discarded. Plasmids for the remaining clones were transformed into yeast cells 
together with plasmid for the original bait. Clones that gave a positive signal after p-galactosidase 
assay were considered true positives. 

EXAMPLE 2 
Identification of ER-ak>ha/PN12364 Interaction 
[0060] A yeast two-hybrid system as described in Example 1 using amino acids 23 1-330 of 
ER-alpha (GB accession no. Ml 2674) as bait was performed. One clone that was identified by this 
procedure included amino acids 1-175 of PN12364. The DNA sequence and the predicted protein 
sequence for PN12364 are set forth in Tables 3 and 4, respectively. 

TABLE 3 

Nucleotide Sequence of PN12364 

gccaaccgcaatggaggctatggctgtgtatgtgtcaacggctggagtggagatgactgcagtgagaacattgatgattgtgccttcgcctcctgt 
actccaggctccacctgcatcgaccgtgtggcctccttctcttgcatgttcccagaggggaaggcaggtctcctgtgtcatctggatgatgcatgca 
tcagcaatccttgccacaagggggcattgtgtgacaccaaccccctaaatgggcaatatatttgcacctgcccacaaggctacaaaggggctgac 
tgcacagaagatgtggatgaatgtgccatggccaatagcaatccttgtgagcatgcaggaaaatgtgtgaacacggatggcgccttccactgtga 
gtgtctgaagggttatgcaggacctcgttgtgagatggacatcaatgagtgccattcagacccctgccagaatgatgctacctgtctggataagatt 
ggaggcttcacatgtctgtgccatgccaggtttcaaaggkgtgcattg (SEQ ID NO:3) 

TABLE 4 

Predicted Amino Acid Sequence of PN 123 64 
ANRNGGYGC VC VNGWS GDDC SENIDDC AF AS CTPGSTCIDRVASFS CMFPEGKAGLLCHL 
DDACISNPCHKGALCDTNPLNGQYICTCPQGYKGADCTEDVDECAMANSNPCEHAGKCVN 
TDGAFHCECLKGYAGPRCEMDINECHSDPCQNDATCLDKIGGFTCLCHARFORXAL (SEC- 
ID NO:4) 

EXAMPLE 3 
Identification of ER-beta/PN12365 Interaction 
[0061] A yeast two-hybrid system as described in Example 1 using amino acids 1-148 of 
ER-beta (GB accession no. X99101) as bait was performed. One clone that was identified by this 
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procedure included amino acids 1-217 of PN12365. The DNA sequence and the predicted protein 
sequence for PN12365 are set forth in Tables 5 and 6, respectively. 

TABLE 5 

Nucleotide Sequence of PN12365 

caacatcgagacccctgtgagaagaaccgctgccagaatggtgggacttgtgtggcccaggccatgctgggaaaagccacgtgccggtgtgc 
ctcagggtttacaggagaggactgccagtactcgacacctcatccatgctttgtgtctcgaccttgcctgaatggcggcacatgccatatgctcagc 
cgggatacctatgagtgcacctgtcaagtcgggtttacaggtaaggagtgccaatggaccgatgcctgcctgtctcatctctgtgcaaatggaagt 
acctgtaccactgtggccaaccagttctcctgcaaatgcctcacaggcttcacagggcagaagtgtgagactgatgtcaatgagtgtgacattcca 
ggacactgccagcttggtggcacctgcctcaacctgcctggttcctaccagtgccagtgccttcagggcttcacaggccagtactgtgacagact 
gtatgtgccctgtgcacactcgccttgtgtcaatggaggctcctgtcggcagactggtgacttcacttttgagtgcaactgccttccagagtatgaag 
agtgtaaggacctcataaaatttatgctgaggaatgagcgacagttcaaggaggagttcctgttctcgagcttgcactac (SEQ ID NO:5) 

TABLE 6 

Predicted Amino Acid Sequence of PN12365 

QHRDPCEKNRCQNGGTCVAQAMLGKATCRCASGFTGEDCQYSTPHPCFVSRPCLNGGTCH 
MLSRDTYECTCQVGFTGKECQWTDACLSHLCANGSTCTTVANQFSCKCLTGFTGQKCETD 
VNECDIPGHCQLGGTCLNLPGSYQCQCLQGFTGQYCDRLYVPCAHSPCVNGGSCRQTGDF 
TFECNCLPEYEECKDLIKFMLRNERQFKEEFLFS SLHY (SEQ ID NO:6) 

EXAMPLE 4 

Generation of Polyclonal Antibody Against Protein Complexes 
[0062] As shown above, ER-alpha interacts with PN12364 to form a complex. A complex 
of the two proteins is prepared, e.g., by mixing purified preparations of each of the two proteins. 
If desired, the protein complex can be stabilized by cross-linking the proteins in the complex, by 
methods known to those of skill in the art. The protein complex is used to immunize rabbits and 
mice using a procedure similar to that described by Harlow et al. (1988). This procedure has been 
shown to generate Abs against various other proteins (for example, see Rraemer et al., 1993). 

[0063] Briefly, purified protein complex is used as immunogen in rabbits. Rabbits are 
immunized with 100 ug of the protein in complete Freund's adjuvant and boosted twice in three- 
week intervals, first with 100 ug of immunogen in incomplete Freund's adjuvant, and followed by 
100 |xg of immunogen in PBS. Antibody-containing serum is collected two weeks thereafter. The 
antisera is preadsorbed with ER-alpha and PN12364, such that the remaining antisera comprises 
antibodies which bind conformational epitopes, i.e., complex-specific epitopes, present on the ER- 
alpha-PN12364 complex but not on the monomers. 
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[0064] Polyclonal antibodies against each of the complexes set forth in Tables 1-2 are 
prepared in a similar manner by mixing the specified proteins together, immunizing an animal and 
isolating antibodies specific for the protein complex, but not for the individual proteins. 

[0065] Polyclonal antibodies against the protein set forth in Tables 4 and 6 are prepared in 
a similar manner by immunizing an animal with the protein and isolating antibodies specific for the 
protein. 

EXAMPLE 5 

Generation of Monoclonal Antibodies Specific for Protein Complexes 
[0066] Monoclonal antibodies are generated according to the following protocol. Mice are 
immunized with immunogen comprising ER-alpha/PN12364 complexes conjugated to keyhole 
limpet hemocyanin using glutaraldehyde or EDC as is well known in the art. The complexes can 
be prepared as described in Example 4, and may also be stabilized by cross-linking. The 
immunogen is mixed with an adjuvant. Each mouse receives four injections of 10 to 100 u,g of 
immunogen, and after the fourth injection blood samples are taken from the mice to determine if the 
serum contains antibody to the immunogen. Serum titer is determined by ELISA or RIA. Mice 
with sera indicating the presence of antibody to the immunogen are selected for hybridoma 
production. 

[0067] Spleens are removed from immune mice and a single-cell suspension is prepared 
(Harlow et al., 1988). Cell fusions are performed essentially as described by Kohler et al. (1975). 
Briefly, P3.65.3 myeloma cells (American Type Culture Collection, Rockville, MD) or NS-1 
myeloma cells are fused with immune spleen cells using polyethylene glycol as described by Harlow 
et al. (1988). Cells are plated at a density of 2x1 0 5 cells/well in 96-well tissue culture plates. 
Individual wells are examined for growth, and the supernatants of wells with growth are tested for 
the presence of ER-alpha/PN12364 complex-specific antibodies by ELISA or RIA using ER- 
alpha/PN12364 complex as target protein. Cells in positive wells are expanded and subcloned to 
establish and confirm monoclonality. 

[0068] Clones with the desired specificities are expanded and grown as ascites in mice or 
in a hollow fiber system to produce sufficient quantities of antibodies for characterization and assay 
development. Antibodies are tested for binding to ER-alpha alone or to PN12364 alone, to 
determine which are specific for the ER-alpha/PN12364 complex as opposed to those that bind to 
the individual proteins. 
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[0069] Monoclonal antibodies against each of the complexes set forth in Tables 1-2 are 
prepared in a similar manner by mixing the specified proteins together, immunizing an animal, 
fusing spleen cells with myeloma cells and isolating clones which produce antibodies specific for 
the protein complex, but not for the individual proteins. 

[0070] Monoclonal antibodies against the protein set forth in Tables 4 and 6 are prepared 
in a similar manner by immunizing an animal with the protein, fusing spleen cells with myeloma 
cells and isolating clones which produce antibodies specific for the protein. 

EXAMPLE 6 

In vitro Identification of Modulators for Protein-Protein Interactions 
[0071] The present invention is useful in screening for agents that modulate the interaction 
of ER-alpha and PN12364. The knowledge that ER-alpha and PN12364 form a complex is useful 
in designing such assays. Candidate agents are screened by mixing ER-alpha and PN12364 (a) in 
the presence of a candidate agent, and (b) in the absence of the candidate agent. The amount of 
complex formed is measured for each sample. An agent modulates the interaction of ER-alpha and 
PN12364 if the amount of complex formed in the presence of the agent is greater than (promoting 
the interaction), or less than (inhibiting the interaction) the amount of complex formed in the 
absence of the agent. The amount of complex is measured by a binding assay, which shows the 
formation of the complex, or by using antibodies immunoreactive to the complex. 

[0072] Briefly, a binding assay is performed in which immobilized ER-alpha is used to bind 
labeled PN12364. The labeled PN12364 is contacted with the immobilized ER-alpha under aqueous 
conditions that permit specific binding of the two proteins to form a ER-alpha/PN12364 complex 
in the absence of an added test agent. Particular aqueous conditions may be selected according to 
conventional methods. Any reaction condition can be used as long as specific binding of ER- 
alpha/PN12364 occurs in the control reaction. A parallel binding assay is performed in which the 
test agent is added to the reaction mixture. The amount of labeled PN12364 bound to the 
immobilized ER-alpha is determined for the reactions in the absence or presence of the test agent. 
If the amount of bound, labeled PN12364 in the presence of the test agent is different than the 
amount of bound labeled PN12364 in the absence of the test agent, the test agent is a modulator of 
the interaction of ER-alpha and PN12364. 

[0073] Candidate agents for modulating the interaction of each of the protein complexes set 
forth in Tables 1-2 are screened in vitro in a similar manner. 
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EXAMPLE 7 

In vivo Identification of Modulators for Protein-Protein Interactions 
[0074] In addition to the in vitro method described in Example 6, an in vivo assay can also 
5 be used to screen for agents which modulate the interaction of ER-alpha and PN12364. Briefly, a 
yeast two-hybrid system is used in which the yeast cells express (1) a first fusion protein comprising 
ER-alpha or a fragment thereof and a first transcriptional regulatory protein sequence, e.g., GAL4 
activation domain, (2) a second fusion protein comprising PN12364 or a fragment thereof and a 
second transcriptional regulatory protein sequence, e.g., GAL4 DNA-binding domain, and (3) a 
10 reporter gene, e.g., P-galactosidase, which is transcribed when an intermolecular complex 
comprising the first fusion protein and the second fusion protein is formed. Parallel reactions are 
p performed in the absence of a test agent as the control and in the presence of the test agent. A 
p" functional ER-alpha/PN12364 complex is detected by detecting the amount of reporter gene 
f : expressed. If the amount of reporter gene expression in the presence of the test agent is different 
\ll5 than the amount of reporter gene expression in the absence of the test agent, the test agent is a 
5 *"~ modulator of the interaction of ER-alpha and PN12364. 

[0075] Candidate agents for modulating the interaction of each of the protein complexes set 
forth in Tables 1-2 are screened in vivo in a similar manner. 

f "20 [0076] While the invention has been disclosed in this patent application by reference to the 

details of preferred embodiments of the invention, it is to be understood that the disclosure is 
intended in an illustrative rather than in a limiting sense, as it is contemplated that modifications will 
readily occur to those skilled in the art, within the spirit of the invention and the scope of the 
appended claims. 
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